A comparison of partial least squares (PLS) and sparse PLS regressions in genomic selection in French dairy cattle.
نویسندگان
چکیده
Genomic selection involves computing a prediction equation from the estimated effects of a large number of DNA markers based on a limited number of genotyped animals with phenotypes. The number of observations is much smaller than the number of independent variables, and the challenge is to find methods that perform well in this context. Partial least squares regression (PLS) and sparse PLS were used with a reference population of 3,940 genotyped and phenotyped French Holstein bulls and 39,738 polymorphic single nucleotide polymorphism markers. Partial least squares regression reduces the number of variables by projecting independent variables onto latent structures. Sparse PLS combines variable selection and modeling in a one-step procedure. Correlations between observed phenotypes and phenotypes predicted by PLS and sparse PLS were similar, but sparse PLS highlighted some genome regions more clearly. Both PLS and sparse PLS were more accurate than pedigree-based BLUP and generally provided lower correlations between observed and predicted phenotypes than did genomic BLUP. Furthermore, PLS and sparse PLS required similar computing time to genomic BLUP for the study of 6 traits.
منابع مشابه
High dimensional classification with combined adaptive sparse PLS and logistic regression
Motivation The high dimensionality of genomic data calls for the development of specific classification methodologies, especially to prevent over-optimistic predictions. This challenge can be tackled by compression and variable selection, which combined constitute a powerful framework for classification, as well as data visualization and interpretation. However, current proposed combinations le...
متن کاملPartial least squares: a versatile tool for the analysis of high-dimensional genomic data
Partial least squares (PLS) is an efficient statistical regression technique that is highly suited for the analysis of genomic and proteomic data. In this article, we review both the theory underlying PLS as well as a host of bioinformatics applications of PLS. In particular, we provide a systematic comparison of the PLS approaches currently employed, and discuss analysis problems as diverse as...
متن کاملRapid and Simultaneous Determination of Montelukast, Fexofenadine and Cetirizine Using Partial Least Squares and Artificial Neural Networks Modeling
Simultaneous determination of pharmaceutical compounds and accurate quantitative prediction of them are of great interest in the clinical and laboratory-based investigations.This work has focused on a comprehensive comparison of Partial Least-Squares (PLS-1) and Artificial Neural Networks (ANN) as two powerful types of chemometric methods. For this purpose, montelukast (MONT), fexofenadine ...
متن کاملGlobally sparse PLS regression
Partial least squares (PLS) regression combines dimensionality reduction and prediction using a latent variable model. It provides better predictive ability than principle component analysis by taking into account both the independent and response variables in the dimension reduction procedure. However, PLS suffers from over-fitting problems for few samples but many variables. We formulate a ne...
متن کاملDiagnosis and prognosis of osteoarthritis by texture analysis using sparse linear models
We present a texture analysis methodology that combines uncommitted machine-learning techniques and sparse feature transformation methods in a fully automatic framework. We compare the performances of a partial least squares (PLS) forward feature selection strategy to a hard threshold sparse PLS algorithm and a sparse linear discriminant model. The texture analysis framework was applied to diag...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of dairy science
دوره 95 4 شماره
صفحات -
تاریخ انتشار 2012